RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 



Source: 



Date Processed by STIC: 



3 - 9- Of 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: natin21helD@usDto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help(a),usDto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM. ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



" ^ Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user- interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3 0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1 825 effective October 1 , 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST 25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- *~ 
compliant. Checker allows public users to check sequence listings in Computer Readable-form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http://www.uspto.gov/web/offices/pac/checker 




ERROR DETECTED SUGGESTED CORRECTION^ 



SERIAL NUMBER: "dP/ Y3d { Sftf C-r 



; ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH -AhPHA" HEADERS, WHICH WERE IW^TRtetfW PTO SOFTWARE 

Wrapped Nucleics The numberAext at the end of each line \vrapped" down to the next line. 

This may occur If your file was retrieved In a word processor after creating it. 

Hl-^' T-r ■:■ -Ptease:ao]ast:yolJi^right:ro * ^-^E^/ZSM 

^ » , . 

: Wrapped Aminos The amino acM number/^ 

This may occurTf your file was ret rievedlln i a L word processor after creating it. ; : 
r^'-rrz^ ~ . Please ao^dstyout^ "wrapping". ' : " 



Incorject Line Length The rules require that a line not exceed 72 characters in length. This includes spaces.^ 

Misaligned Amino Acid The numbering under each 5th amino acid is misaligned. This may be caused by the use of tabs 
Numbering 



Non-ASCII 



Variable Length 



Patentln ver. 2.0 "bug" 



Skipped Sequences 
(OLD RULES) 



Skipped Sequences 
(NEW RULES) 



between the numbering. It is recommended to delete any tabs and use spacing batwe^fme numbers. 

This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text so that it can be processed. 

i 2 

Sequence(s)' contain n's or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) feature section that some may be missing. 

A "bug" in* Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequence(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. This applies primarily to the mandatory <220>-<223> 
sections for Artificial or Unknown sequences. 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(i) SEQUENCE CHARACTERlSTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence is intentionally skipped 

Please also adjust the "(iii) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 



1 0 Use of n's or Xaa's Use of n's and/or Xaa's have been detected in the Sequence Listing. 

(NEW RULES) Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



1 1 Use of <213> Organism 

(NEW RULES) 

12 Use of <220>Feature 

(NEW RULES) 



Sequence(s) _ 



p are missing this mandatory field or its response. 



Sequence(s) are missing the <220>Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <213>ORGANISM is "Artificial" or "Unknown" 
Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 6/01/98, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of new Rules) 



13 



Patentln ver. 2.0 "bug" 



Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted 

file, resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence listing). 

Instead, please use "File Manager" or any other means to copy file to floppy disk. 



AKS-Biotechnology Systems Branch- 5/1 5/99 



if 



% % 



Pase 1 of 8 



1636 



3 



RAW SEQUENCE LISTING DATE: 03/09/2001 

PATENT APPLICATION: US/09/4 3 0 , 590C TIME: 11:58:25 

Input Set : A:\ES.txt QOeS ^?\P?^MgarlQ(j| 

Output Set: N:\CRF3\03092001\I430590C.raw COlTeCted DlSK©U© » WM 

<110> APPLICANT: Poulter, et al . $^ Pf*' ^ 

5 <120> TITLE OF INVENTION: UNUSUAL RETROTRANSPOSON FROM THE YEAST CANDIDA ALBICANS 

7 <130> FILE REFERENCE: 674521*2001.1 

9 <140> CURRENT APPLICATION NUMBER: 09/430, 590C 

10 <141> CURRENT FILING DATE: 1999-10-29 

12 <150> PRIOR APPLICATION NUMBER: 60/106,342 

13 <151> PRIOR FILING DATE: 1998-10-30 

15 <160> NUMBER OF SEQ ID NOS : 156 y J 

17 <170> SOFTWARE: Patentin version 3.0 , y , // A C?'^" 0 ' ' 

19 <210> SEQ ID NO: 1 V G f ' U & C ( / J 

20 <211> LENGTH: 13 /< it / r&jOSd-S?' 1 

21 <212> TYPE: DNA H /K&y C~r? ^ f / 

22 <213> ORGANISM: " ^ 1 11 ' 

24 <220> FEATURE: 



Candida albicans . -J <T ^ 

^7 o~ s,n$/e ro.s)do<i, ic^y 



25 <221> NAME/KEY: misc„feature -fih f 7^4 . /-"/T^D F ^u^^° r V 

26 <222> LOCATION: (7).. (7) ' -^F (=> &<n & {Z ' ' / 

27 <223> OTHER INFORMATION: nucleotid e^ Mi ' represents the 376 nucleotide sequence in between r/^ 

28 the inverted repeats of the""LTR of TCal in Candida albican 3/?C<?7^. 

31 <300> PUBLICATION INFORMATION: 

32 <308> DATABASE ACCESSION NO: AF043301 

33 <309> DATABASE ENTRY DATE : 1998-07-21 

34 <313> RELEVANT RESIDUES: (1)..(13) 
36 <400> SEQUENCE: 1 

W--> 3 7 tgttcgncga aca 13 

40 <210> SEQ ID NO: 2 

41 <211> LENGTH: 19 

42 <212> TYPE: DNA . /, ^ 

43 <213> ORGANISM: Candida albicans [/ r , a 4/a ((l^cA^ <Lr rUir 

45 <220> FEATURE: \J O 1 J 

46 <221> NAME/KEY: misc_f eature x ~s? S <^ CXDO^/^ 

47 <222> LOCATION: (10).. (10) ^ 

48 <223> OTHER INFORMATION: nucle otide 'n' represents the 382 nucleotide sequence in between 
4 9 the inverted repeats of the LTR of another element in Candida alb 

50 ican 

53 <300> PUBLICATION INFORMATION : 

54 <308> DATABASE ACCESSION NO: Y08494 

55 <309> DATABASE ENTRY DATE: 1997-08-27 

56 <313> RELEVANT RESIDUES: (1)..(19) 
58 <400> SEQUENCE: 2 

W--> 59 taatgtatan tatacaaca 19 

62 <210> SEQ ID NO: 3 

63 <211> LENGTH: 6426 

64 <212> TYPE: DNA 

65 <213> ORGANISM: Candida albicans 

67 <220> FEATURE: 

68 <221> NAME/KEY: CDS 

69 <222> LOCATION: (398) .. (1372) 



RECEIVED 

MAR 2 2 2001 
TECH CENTER 1600/2990 
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RAW SEQUENCE LISTING DATE: 03/09/2001 

PATENT APPLICATION: US/09/4 30 , 5 9 0C TIME: 11:58:25 



Input Set : A:\ES.txt 

Output Set: N:\CRF3\03092001\I430590C.raw 



70 <223> 


OTHER INFORMATION: 


ORF1 


coding 


sequence 


fox 


gag 








73 <220> 


FEATURE : 




























74 <221> 


NAME/KEY : CDS 


























75 <222> 


LOCATION: 


( 1373) . . 


(6103)' 




















76 <223> 


OTHER INFORMATION: 


ORF2 - coding sequence for pol 








79 <400> 


SEQUENCE: 


3 


























80 tgttggtttg tgcactattt 


tgtgtcagaa 


actgatcaat gaaaatgatg gttattatga 


60 


82 gaatggaaaa tttttccatc 


acacatcagg 


tgatgacaga actaaactat attgtgtagt 


120 


84 ataaataagg gtatgaaata 


ccaacatccc 


agaatatcaa cgagatagaa gggaggagtt 


180 


8 6 tcaatatata tcttgtgaat 


aataacttcg 


ttctaattca ctatacacaa ctagacgtgt 


240 


88 acacgctcaa tctcaggtaa 


agaaagttta 


tattccatca gattagaagt cgatagtgat 


300 


90 aatcatttcg tcccaaatta 


gcgttgtata 


aattcagtcc tcagatttgt attattgatt 


360 


92 gatagtttcg aagtttgaag 


gtacagaatt 


tcacaag 


'atg 


agt 


tec 


gca 


aag 


aat 


415 


93 




















Met 


Ser 


Ser 


Ala 


Lys 


Asn 




94 




















1 








5 






96 gat gat aac gaa 


ggg aag gtc atg gaa agt gtt gat caa get aat get 


463 


97 Asp Asp Asn Glu Gly Lys Val Met Glu Ser Val Asp Gin Ala Asn Ala 




98 




10 








15 








20 








100 att 


agt 


aag 


gtg 


gat 


gaa 


cat 


ate 


aag 


get 


aga 


ttc 


aat 


atg 


ctt 


ttc 


511 


101 He 


Ser 


Lys 


Val 


Asp 


Glu 


His 


He 


Lys 


Ala 


Arg 


Phe 


Asn 


Met 


Leu 


Phe 




102 




25 










30 










35 










104 ata 


aaa 


ttt 


aat 


gac 


tta 


cct 


aag 


ttg 


gec 


gtc 


ggt 


aat 


cag 


aaa 


age 


559 


105 He 


Lys 


Phe 


Asn 


Asp 


Leu 


Pro 


Lys 


Leu 


Ala 


Val 


Gly 


Asn 


Gin 


Lys 


Ser 




106 


40 










45 










50 












108 gtg 


gat 


aaa 


tgg 


aat 


gaa 


gaa 


ttt 


aaa 


tat 


ttc 


cac 


gtt 


get 


tac 


ccc 


607 


109 Val 


Asp 


Lys 


Trp 


Asn 


Glu 


Glu 


Phe 


Lys 


Tyr 


Phe 


His 


Val 


Ala 


Tyr 


Pro 




110 55 










60 










65 










70 




112 gat 


gtt 


ttg 


gaa 


ttt 


ttg 


ctt 


gac 


tat 


aat 


cct 


aaa 


gat 


aaa 


ttc 


aag 


655 


113 Asp 


Val 


Leu 


Glu 


Phe 


Leu 


Leu 


Asp 


Tyr 


Asn 


Pro 


Lys 


Asp 


Lys 


Phe 


Lys 




114 








75 










80 










85 






116 gtt 


aaa 


aag 


gta 


gaa 


ggt 


att 


tat 


ttt 


act 


ggt 


tgg 


tgt 


tta 


caa 


atg 


703 


117 Val 


Lys 


Lys 


Val 


Glu 


Gly 


He 


Tyr 


Phe 


Thr 


Gly 


Trp 


Cys 


Leu 


Gin 


Met 




118 






90 










95 










100 








120 tgt 


tta 


cag 


tec 


att 


ttt 


gat 


agg 


ttc 


aga 


ttg 


ate 


atg 


att 


tct 


aag 


751 


121 Cys 


Leu 


Gin 


Ser 


He 


Phe 


Asp 


Arg 


Phe 


Arg 


Leu 


He 


Met 


He 


Ser 


Lys 




122 




105 










110 










115 










124 eta 


cca 


aag 


cac 


ttg 


caa 


aag 


gaa 


gca 


aac 


tta 


ate 


aaa 


get 


get 


tat 


7 99 


125 Leu 


Pro 


Lys 


His 


Leu 


Gin 


Lys 


Glu 


Ala 


Asn 


Leu 


He 


Lys 


Ala 


Ala 


Tyr 




126 


120 










125 










130 












128 gat 


get 


gtt 


act 


aaa 


tct 


aaa 


gat 


tat 


acc 


att 


act 


agt 


aag 


ate 


ttg 


847 


129 Asp 


Ala 


Val 


Thr 


Lys 


Ser 


Lys 


Asp 


Tyr 


Thr 


He 


Thr 


Ser 


Lys 


He 


Leu 




130 135 










140 










145 










150 




132 ctg 


aag 


ttt 


gta 


aac 


gtt 


gaa 


cat 


gag 


tta 


gtg 


gtt 


tgc 


tat 


aac 


ctt 


895 


133 Ser 


Lys 


Phe 


Val 


Asn 


Val 


Glu 


His 


Glu 


Leu 


Val 


Val 


Cys 


Tyr 


Asn 


Leu 




134 








155 










160 










165 






136 cca 


tat 


ttg 


ctg 


cag 


gtg 


gaa 


gag 


aaa 


ctt 


gag 


gaa 


ata 


etc 


tac 


aac 


943 


137 Pro 


Tyr 


Leu 


Ser 


Gin 


Val 


Glu 


Glu 


Lys 


Leu 


Glu 


Glu 


He 


Leu 


Tyr 


Asn 




138 






170 










175 










180 








140 act 


tea 


aac 


gtt 


gtc 


gat 


gag 


tat 


gtc 


cgt 


agt 


ctt 


cca 


aat 


etc 


ata 


991 
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RAW SEQUENCE LISTING DATE: 03/09/2001 

PATENT APPLICATION: US/09/4 30 , 59 0C TIME: 11:58:25 



Input Set : A:\ES.txt 

Output Set: N:\CRF3\G3092OOl\I430590C.raw 



141 


Thr 


Ser 


Asn 


Val 


Val 


Asp 


Glu 


Tyr 


Val 


Arg 


Ser ■ 


Leu 


Pro 


Asn 


Leu 


He 




142 






185 










190 










195 










144 


ggt 


caa 


gtc 


ttg 


tac 


ttc 


aat 


cat 


gtg 


aag 


aaa 


tea 


gag 


get 


tta 


agt 


103 9 


145 


Gly 


Gin 


Val 


Leu 


Tyr 


Phe 


Asn 


His 


Val 


Lys 


Lys 


Ser 


Glu 


Ala 


Leu 


Ser 




146 




200 










205 










210 












148 


ttg 


ttt 


ttg 


aat 


att 


cat 


gee 


tea 


tac 


tac 


tea 


aag 


tgg 


att 


caa 


get 


i n q i 


149 


Leu 


Phe 


Leu 


Asn 


He 


His 


Ala 


Ser 


Tyr 


Tyr 


Ser 


Lys 


Trp 


He 


Gin 


Ala 




150 


215 










220 










225 










2 30 




152 


gac 


aat 


gat 


aca 


tea 


gta 


etc 


cca 


agt 


tgc 


tct 


acc 


ata 


get 


gaa 


gaa 


11 j J 


153 


Asp 


Asn 


Asp 


Thr 


Ser 


Val 


Leu 


Pro 


Ser 


Cys 


Ser 


Thr 


He 


Ala 


Glu 


Glu 




154 








235 










240 










24 5 






156 


atg 


tgt 


gat 


cat 


cct 


gat 


tat 


get 


aga 


ttg 


gtt 


gac 


att 


cca 


age 


aac 


lib J 


157 


Met 


Cys 


Asp 


His 


Pro 


Asp 


Tyr 


Ala 


Arg 


Leu 


Val 


Asp 


He 


Pro 


Ser 


Asn 




158 








250 










2 55 










260 








160 


aaa 


tat 


gaa 


ctt 


aat 


ctt 


att 


gtt 


agt 


tta 


cca 


gca 


cca 


gag 


aaa 


cca 


±2 j 1 


161 


Lys 


Tyr 


Glu 


Leu 


Asn 


Leu 


He 


Val 


Ser 


Leu 


Pro 


Ala 


Pro 


Glu 


Lys 


Pro 




162 




265 










270 










275 










164 


aaa 


gga 


aaa 


cca 


gag 


gag 


aac 


tea 


ctg 


gaa 


caa 


tct 


caa 


aag 


aag 


aac 


1279 


165 


Lys 


Gly 


Lys 


Pro 


Glu 


Glu 


Asn 


Ser 


Ser 


Glu 


Gin 


Ser 


Gin 


Lys 


Lys 


Asn 




166 




280 










285 










290 












168 


ctg 


aaa 


tea 


aga 


aag 


aga 


aat 


aag 


aaa 


cat 


cca 


aaa 


tea 


gat 


aac 


gat 


1327 


169 


Ser 


Lys 


Ser 


Arg 


Lys 


Arg 


Asn 


Lys 


Lys 


His 


Pro 


Lys 


Ser 


Asp 


Asn 


Asp 




170 


295 








300 










305 










310 
J ■ 
aaa 




172 


aaa 


ggt 


gaa 


aaa 


gaa 


aaa 


gaa 


aaa 


gaa 


aaa 


act 


tea 


ctg 


gaa 


tga 


IS / D 


173 


Lys 


Gly 


Glu 


Lys 


Glu 


Lys 


Glu 


Lys 


Glu 


Lys 


Thr 


Ser 


Ser 


Glu 




Lys 




174 










315 










320 












3 25 




176 


aca 


ggt 


get 


get 


tct 


att 


aat 


tgt 


gta 


atg 


aat 


ata 


cat 


aat 


tgc 


age 


1423 


177 


Thr 


Gly 


Ala 


Ala 


Ser 


He 


Asn 


Cys 


Val 


Met 


Asn 


He 


His 


Asn 


Cys 


Ser 




178 










330 










335 










340 






180 


aaa 


acc 


acg 


ttt 


cca 


gta 


gaa 


aat 


tct 


cat 


tct 


ctt 


aat 


get 


tct 


ttg 


1 A H 1 

14 / 1 


181 


Lys 


Thr 


Thr 


Phe 


Pro 


Val 


Glu 


Asn 


Ser 


His 


Ser 


Leu 


Asn 


Ala 


Ser 


Leu 




182 






345 










350 










355 








184 


aac 


gta 


atg 


aat 


ttt 


aaa 


ggt 


tta 


agg 


ttt 


aac 


aag 


tat 


eta 


gtg 


tat 


1519 


185 


Asn 


Val 


Met 


Asn 


Phe 


Lys 


Gly 


Leu 


Arg 


Phe 


Asn 


Lys 


Tyr 


Leu 


Val 


Tyr 




186 






360 










365 










370 










188 


gat 


act 


ggt 


gee 


aca 


ata 


tct 


gtt 


gtg 


aac 


aat 


aaa 


gat 


ata 


ttg 


ctg 


1567 


189 


Asp 


Thr 


Gly 


Ala 


Thr 


He 


Ser 


Val 


Val 


Asn 


Asn 


Lys 


Asp 


He 


Leu 


Ser 




190 




375 










380 










385 












192 


aat 


gtt 


aag 


gac 


gca 


aca 


att 


gaa 


gtt 


tct 


gtt 


get 


gat 


ggt 


get 


aca 


1615 


193 


Asn 


Val 


Lys 


Asp 


Ala 


Thr 


He 


Glu 


Val 


Ser 


Val 


Ala 


Asp 


Gly 


Ala 


Thr 




194 


390 










395 










400 










405 




196 


tta 


gaa 


gca 


gat 


tgt 


att 


ggt 


gat 


eta 


att 


ate 


aga 


gtc 


ggt 


att 


gtc 


1663 


197 


Leu 


Glu 


Ala 


Asp 


Cys 


He 


Gly 


Asp 


Leu 


He 


lie 


Arg 


Val 


Gly 


He 


Val 




198 










410 










415 










420 






200 


teg 


att 


acg 


tta 


gag 


aat 


aca 


ttg 


tat 


tta 


cca 


gaa 


agt 


tec 


ttt 


aat 


1711 


201 


Ser 


lie 


Thr 


Leu 


Glu 


Asn 


Thr 


Leu 


Tyr 


Leu 


Pro 


Glu 


Ser 


Ser 


Phe 


Asn 




202 








425 










430 










435 








204 


ctt 


gtg 


agt 


ttg 


aaa 


caa 


att 


gaa 


gaa 


cga 


gga 


ttt 


aat 


gtt 


ctt 


att 


1759 


205 


Leu 


Val 


Ser 


Leu 


Lys 


Gin 


He 


Glu 


Glu 


Arg 


Gly 


Phe 


Asn 


Val 


Leu 


He 
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206 






440 










445 










450 










208 


act 


aaa 


gaa 


tea 


gtg 


att 


gta 


ttt 


aac 


caa 


aat 


gtg 


get 


cct 


act 


att 


1807 


209 


Thr 


Lys 


Glu 


Ser 


Val 


He 


Val 


Phe 


Asn 


Gin 


Asn 


Val 


Ala 


Pro 


Thr 


He 




210 




455 










460 










465 












212 


att 


get 


tea 


agg 


aag 


aat 


get 


get 


gat 


ctt 


tat 


atg 


ggt 


cct 


caa 


ttc 


18 55 


213 


He 


Ala 


Ser 


Arg 


Lys 


Asn 


Ala 


Ala 


Asp 


Leu 


Tyr 


Met 


Gly 


Pro 


Gin 


Phe 




214 


470 










475 










480 










485 




216 


agt 


gaa 


gaa 


tct 


tta 


gaa 


tgt 


gat 


ttt 


gat 


tat 


gat 


ggt 


ttg 


gca 


gat 


1903 


217 


Ser 


Glu 


Glu 


Ser 


Leu 


Glu 


Cys 


Asp 


Phe 


Asp 


Tyr 


Asp 


Gly 


Leu 


Ala 


Asp 




218 










490 










495 










500 






220 


atg 


ttg 


tec 


aat 


get 


aac 


caa 


gat 


gac 


aaa 


gat 


aaa 


tea 


agt 


atg 


aat 


1951 


221 


Met 


Leu 


Ser 


Asn 


Ala 


Asn 


Gin 


Asp 


Asp 


Lys 


Asp 


Lys 


Ser 


Ser 


Met 


Asn 




222 








'505 










510 










515 








224 


gaa 


atg 


'tea 


gaa 


tat 


caa 


gaa 


cat 


gat 


tat 


agt 


tct 


cga 


gca 


tta 


ata 


1999 


225 


Glu 


Met 


Ser 


Glu 


Tyr 


Gin 


Glu 


His 


Asp 


Tyr 


Ser 


Ser 


Arg 


Ala 


Leu 


He 




226 






520 










525 










530 










228 


aat 


tct 


ttg 


acg 


gag 


gtt 


gat 


gtt 


tta 


gat 


gtt 


gaa 


att 


tec 


cca 


tat 


2047 


229 


Asn 


Ser 


Leu 


Thr 


Glu 


Val 


Asp 


Val 


Leu 


Asp 


Val 


Glu 


He 


Ser 


Pro 


Tyr 




230 




535 










540 










545 












232 


gga 


gtt 


gaa 


caa 


ttg 


eta 


cca 


act 


gga 


gat 


aag 


aac 


gat 


att 


tat 


aat 


2095 


233 


Gly 


Val 


Glu 


Gin 


Leu 


Leu 


Pro 


Thr 


Gly 


Asp 


Lys 


Asn 


Asp 


He 


Tyr 


Asn 




234 


550 










555 










560 










565 




236 


ttc 


cat 


ttg 


atg 


tea 


aat 


cat 


atg 


tec 


att 


gag 


aaa 


ate 


ttg 


ttg 


tta 


2143 


237 


Phe 


His 


Leu 


Met 


Ser 


Asn 


His 


Met 


Ser 


He 


Glu 


Lys 


He 


Leu 


Leu 


Leu 




238 










570 










575 










580 






240 


caa 


aaa 


tac 


cag 


ggt 


etc 


gta 


ctt 


cac 


act 


tea 


aaa 


gag 


agt 


ctt 


caa 


2191 


241 


Gin' 


Lys 


Tyr 


Gin 


Gly 


Leu 


Val 


Leu 


His 


Thr 


Ser 


Lys 


Glu 


Ser 


Leu 


Gin 




242 






585 










590 










595 








244 


aag 


att 


get 


gat 


tgt 


aag 


gta 


tgt 


eta 


tta 


teg 


aat 


gee 


aaa 


cag 


aga 


2239 


245 


Lys 


He 


Ala 


Asp 


Cys 


Lys 


Val 


Cys 


Leu 


Leu 


Ser 


Asn 


Ala 


Lys 


Gin 


Arg 




246 






600 










605 










610 










248 


agt 


cac 


aat 


cat 


cat 


tea 


gaa 


aga 


aaa 


gec 


teg 


aga 


aga 


cat 


gag 


aga 


2287 


249 


Ser 


His 


Asn 


His 


His 


Ser 


Glu 


Arg 


Lys 


Ala 


Ser 


Arg 


Arg 


His 


Glu 


Arg 




250 




615 










620 










625 












252 


ctt 


cat 


tgt 


gat 


act 


etc 


ggt 


cca 


ttt 


agg 


tec 


gaa 


aat 


aac 


aag 


tgg 


2335 


253 


Leu 


His 


Cys 


Asp 


Thr 


Leu 


Gly 


Pro 


Phe 


Arg 


Ser 


Glu 


Asn 


Asn 


Lys 


Trp 




254 


630 










635 










640 










645 




256 


tat 


tta 


acg 


tct 


gtt 


ata 


gat 


gaa 


cat 


acg 


ggt 


tac 


att 


gaa 


gga 


att 


2383 


257 


Tyr 


Leu 


Thr 


Ser 


Val 


He 


Asp 


Glu 


His 


Thr 


Gly 


Tyr 


He 


Glu 


Gly 


lie 




258 








650 










655 










660 






260 


att 


act 


aaa 


gac 


aga 


aag 


gta 


aag 


gat 


etc 


tta 


att 


caa 


cga 


tta 


aag 


2431 


261 


He 


Thr 


Lys 


Asp 


Arg 


Lys 


Val 


Lys 


Asp 


Leu 


Leu 


He 


Gin 


Arg 


Leu 


Lys 




262 








665 










670 










675 








264 


ate 


tgg 


aat 


aat 


egg 


ttt 


aac 


gat 


aag 


gtg 


gca 


tac 


ttc 


aga 


agt 


gat 


2479 


265 


He 


Trp 


Asn 


Asn 


Arg 


Phe 


Asn 


Asp 


Lys 


Val 


Ala 


Tyr 


Phe 


Arg 


Ser 


Asp 




266 






680 










685 










690 










268 


aat 


get 


cct 


gag 


ttc 


cca 


caa 


cct 


tct 


gat 


tta 


get 


gag 


ttc 


ggt 


att 


2527 


269 


Asn 


Ala 


Pro 


Glu 


Phe 


Pro 


Gin 


Pro 


Ser 


Asp 


Leu 


Ala 


Glu 


Phe 


Gly 


He 




270 




695 










700 










705 
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272 


tgg 


agg 


gag 


act 


ata 


gcg 


gca 


tat 


ctg- 


cct 


gag 


ctt 


aat 


ggt 


etc 


gee 


2 57 5 


273 


Trp 


Arg 


Glu 


Thr 


He 


Ala 


Ala 


Tyr 


Ser 


Pro 


Glu 


Leu 


Asn 


Gly 


Leu 


Ala 




274 


710 










715 










720 










7 25 




276 


gag 


gtt 


gtt 


aat 


aaa 


ttg 


att 


tta 


caa 


cag 


att 


tac 


agg 


ate 


gtt 


gtg 


2623 


277 


Glu 


Val 


Val 


Asn 


Lys 


Leu 


He 


Leu 


Gin 


Gin 


He 


Tyr 


Arg 


He 


Val 


Val 




278 










730 










735 










740 






280 


aca 


ctt 


ggt 


cca 


caa 


ata 


etc 


aag 


ttg 


att 


tat 


tat 


gtg 


att 


caa 


tat 


2671 


281 


Thr 


Leu 


Gly 


Pro 


Gin 


He 


Leu 


Lys 


Leu 


He 


Tyr 


Tyr 


Val 


He 


Gin 


Tyr 




282 








745 










750 










755 








284 


tct 


att 


aca 


atg 


ate 


aac 


cac 


act 


cca 


cgt 


cgt 


tea 


etc 


aag 


gga 


caa 


2719 


285 


Ser 


He 


Thr 


Met 


He 


Asn 


His 


Thr 


Pro 


Arg 


Arg 


Ser 


Leu 


Lys 


Gly 


Gin 




286 






760 










765 










770 










288 


acc 


cct 


tat 


ggt 


tgc 


tat 


tat 


caa 


tta 


agt 


gag 


gga 


aat 


ttc 


tac 


egg 


27 67 


289 


Thr 


Pro 


Tyr 


Gly 


Cys 


Tyr 


Tyr 


Gin 


Leu 


Ser 


Glu 


Gly 


Asn 


Phe 


Tyr 


Arg 




290 




775 










780 










785 












292 


ttt 


cct 


ttt 


gec 


ate 


gat 


tgt 


gtc 


gtt 


aca 


ttt 


agt 


aat 


gec 


ate 


gaa 


2815 


293 


Phe 


Pro 


Phe 


Ala 


He 


Asp 


Cys 


Val 


Val 


Thr 


Phe 


Ser 


Asn 


Ala 


He 


Glu 




294 


790 










795 










800 










805 




296 


aag 


aac 


cgt 


tac 


gga 


gtt 


aca 


tea 


act 


aaa 


gga 


get 


cct 


tea 


teg 


ate 


2863 


297 


Lys 


Asn 


Arg 


Tyr 


Gly 


Val 


Thr 


Ser 


Thr 


Lys 


Gly 


Ala 


Pro 


Ser 


Ser 


He 




298 










810 










815 










820 






300 


atg 


ggt 


get 


gtg 


att 


ggc 


tac 


get 


age 


gat 


tgt 


ttt 


agt 


tat 


tac 


gtg 


2911 


301 


Met 


Gly 


Ala 


Val 


He 


Gly 


Tyr 


Ala 


Ser 


Asp 


Cys 


Phe 


Ser 


Tyr 


Tyr 


Val 




302 








825 










830 










835 








304 


ttg 


eta 


aaa 


aat 


atg 


egg 


tgt 


gat 


att 


ate 


ctt 


age 


cct 


aat 


gtc 


cgt 


2959 


305 


Leu 


Leu 


Lys 


Asn 


Met 


Arg 


Cys 


Asp 


He 


He 


Leu 


Ser 


Pro 


Asn 


Val 


Arg 




306 






840 










845 










850 










308 


ata 


ttg 


cga 


age 


tat 


gag 


gtt 


att 


aac 


tec 


tat 


etc 


aaa 


aac 


tta 


tec 


3007 


309 


He 


Leu 


Arg 


Ser 


Tyr 


Glu 


Val 


He 


Asn 


Ser 


Tyr 


Leu 


Lys 


Asn 


Leu 


Ser 




310 




855 










860 










865 












312 


act 


aca 


cct 


atg 


tea 


cac 


att 


gtt 


cct 


atg 


get 


gaa 


ggt 


ate 


cag 


gga 


3055 


313 


Thr 


Thr 


Pro 


Met 


Ser 


His 


He 


Val 


Pro 


Met 


Ala 


Glu 


Gly 


He 


Gin 


Gly 




314 


870 










875 










880 










885 




316 


agg 


caa 


ctg 


ggc 


get 


cag 


tac 


gag 


gta 


cgc 


gga 


aca 


tat 


gtg 


gaa 


agt 


3103 


317 


Arg 


Gin 


Ser 


Gly 


Ala 


Gin 


Tyr 


Glu 


Val 


Arg 


Gly 


Thr 


Tyr 


Val 


Glu 


Ser 




318 










890 










895 










900 






320 


gaa 


tat 


gac 


aat 


aca 


aat 


gac 


gtg 


atg 


cac 


atg 


ccc 


aaa 


gag 


tea 


tat 


11 CI 

3 1 D 1 


321 


Glu 


Tyr 


Asp 


Asn 


Thr 


Asn 


Asp 


Val 


Met 


His 


Met 


Pro 


Lys 


Glu 


Ser 


Tyr 




322 








905 










910 










915 








324 


tea 


gtt 


cag 


cca 


gca 


teg 


ttt 


act 


tta 


act 


acg 


ggt 


aac 


agt 


tct 


aac 


3199 


325 


Ser 


Val 


Gin 


Pro 


Ala 


Ser 


Phe 


Thr 


Leu 


Thr 


Thr 


Gly 


Asn 


Ser 


Ser 


Asn 




326 






920 










925 










930 










328 


gaa 


tat 


gtt 


ata 


aat 


gat 


gat 


cca 


gta 


cag 


att 


acc 


att 


gag 


aat 


ccc 


3247 


329 


Glu 


Tyr 


Val 


He 


Asn 


Asp 


Asp 


Pro 


Val 


Gin 


He 


Thr 


He 


Glu 


Asn 


Pro 




330 




935 










940 










945 












332 


gat 


gat 


ttt 


tct 


aac 


cct 


ctt 


caa 


eta 


act 


gaa 


gaa 


tea 


cac 


gat 


atg 


3295 


333 


Asp 


Asp 


Phe 


Ser 


Asn 


Pro 


Leu 


Gin 


Leu 


Thr 


Glu 


Glu 


Ser 


His 


Asp 


Met 




334 


950 










955 










960 










965 




336 


gta 


tec 


gaa 


gta 


aaa 


teg 


gat 


gag 


aat 


cct 


aaa 


ccc 


agt 


etc 


cac 


gag 


3343 



I 1 ' 



Please Note: . 
Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> to 
<223> fields of each sequence which presents at least one n or Xaa. 
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L:37 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 
L:59 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 2 



L: 


1152 


M: 


341 


W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


6 


L: 


1176 


M: 


341 


W 


(46) 


"n" 


or 


"Xaa" 


used/ 


for 


SEQ 


ID# 


7 


L : 


1178 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


7 


L: 


1180 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


7 


L: 


1182 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


7 


L: 


1186 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


7 


L: 


1188 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID* 


7 


L: 


1190 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


7 


L: 


1192 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


7 


L 


1194 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


7 


L 


1208 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


7 


L * 


1492 


M: 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


11 


L 


1604 


M: 


341 


w 


(46) 


"n ,r 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


13 


L 


1608 


M. 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


13 


L 


1720 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


13 


L 


1722 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


13 


L 


1926 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


14 


L 


2122 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


16 


L 


2246 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


17 


L 


2266 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


18 


L 


2314 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


18 


L 


2316 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


18 


L 


2318 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


18 


L 


2356 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


19 


L 


2496 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


21 


L 


3147 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


25 


L 


3175 


M 


341 


w 


(46) 


" n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


25 


L 


3181 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


.25 


L 


3183 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:25 


L 


3269 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used , 


for 


SEQ 


ID# 


:26 


L 


3271 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:26 


L 


3429 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:28 


L 


3493 


M 


341 


w 


• (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:28 


L 


3543 


M 


341 


w 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:28 


L 


3625 


M 


341 


w 


. (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID* 


:29 


L 


3645 


M 


341 


w 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:30 


L 


3647 


M 


341 


w 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:30 


L 


3759 


M 


341 


w 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:31 


L 


3761 


M 


341 


w 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:31 


L 


3763 


M 


341 


w 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 31 


L 


3765 


M 


341 


w 


: (46) 


" n " 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 31 


L 


3767 


M 


341 


w 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 31 


L 


3769 


M 


341 


w 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 31 


L 


3821 


M 


341 


w 


: (46) 


,. n ,. 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:32 


L 


3877 


M 


341 


w 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


: 33 


L 


3993 


M 


341 


w 


: (46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:33 
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L:3995 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:33 
L:5079 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:85 
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